Nonparametric Estimation and On-Line Prediction for General Stationary Ergodic Sources
نویسنده
چکیده
We propose a learning algorithm for nonparametric estimation and on-line prediction for general stationary ergodic sources. The idea is to prapare many histograms and estimate the probability distribution of the bins in each histogarm. We do not know a priori which histogram expresses the true distribution: if the histogram is too sharp, the estimation captures the noise too much (overestimation). To this end, we weight those distributions to obtain the estimation of the true distribution. As long as the weights are positive, we obtain a desired property: the Kullback-Leiber information divided by the number n of examples diminishes as n grows.
منابع مشابه
N ov 2 00 7 Compression - based methods for nonparametric density estimation , on - line prediction , regression and classification for time series
We address the problem of nonparametric estimation of characteristics for stationary and ergodic time series. We consider finite-alphabet time series and real-valued ones and the following four problems: i) estimation of the (limiting) probability P (u0 . . . us) for every s and each sequence u0 · · · us of letters from the process alphabet (or estimation of the density p(x0, . . . , xs) for re...
متن کاملCompression-based methods for nonparametric on-line prediction, regression, classification and density estimation of time series
Jorma Rissanen has discovered some deep connections between universal coding (or universal data compression) and mathematical statistics. In particular, the MDL principle has been one of the most powerful methods of modern mathematical statistics. In this paper we apply Rissanen’s approach and ideas to some statistical problems concerned with time series. We address the problem of nonparametric...
متن کاملCompression - based methods for nonparametric density estimation , prediction , regression and classification for time series
We address the problem of nonparametric estimation of characteristics for stationary and ergodic time series. We consider finite-alphabet time series and real-valued ones and the following four problems: i) estimation of the (limiting) probability P (u0 . . . us) for every s and each sequence u0 · · · us of letters from the process alphabet (or estimation of the density p(x0, . . . , xs) for re...
متن کاملMultiple Change-Point Estimation in Stationary Ergodic Time-Series
Given a heterogeneous time-series sample, it is required to find the points in time (called change points) where the probability distribution generating the data has changed. The data is assumed to have been generated by arbitrary, unknown, stationary ergodic distributions. No modeling, independence or mixing are made. A novel, computationally efficient, nonparametric method is proposed, and is...
متن کاملNonparametric Learning Capabilities of Fuzzy Systems
Nonparametric estimation capabilities of fuzzy systems in stochastic environments are analyzed in this paper. By using ideas from sieve estimation, increasing sequences of fuzzy rule-based systems, capable of consistently estimating regression surfaces in different settings, are constructed. Results include least squares learning of a mapping perturbed by additive random noise in a static-regre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1002.4453 شماره
صفحات -
تاریخ انتشار 2010